Comparability of Mixed IC50 Data – A Statistical Analysis

نویسندگان

  • Tuomo Kalliokoski
  • Christian Kramer
  • Anna Vulpetti
  • Peter Gedeck
چکیده

The biochemical half maximal inhibitory concentration (IC50) is the most commonly used metric for on-target activity in lead optimization. It is used to guide lead optimization, build large-scale chemogenomics analysis, off-target activity and toxicity models based on public data. However, the use of public biochemical IC50 data is problematic, because they are assay specific and comparable only under certain conditions. For large scale analysis it is not feasible to check each data entry manually and it is very tempting to mix all available IC50 values from public database even if assay information is not reported. As previously reported for Ki database analysis, we first analyzed the types of errors, the redundancy and the variability that can be found in ChEMBL IC50 database. For assessing the variability of IC50 data independently measured in two different labs at least ten IC50 data for identical protein-ligand systems against the same target were searched in ChEMBL. As a not sufficient number of cases of this type are available, the variability of IC50 data was assessed by comparing all pairs of independent IC50 measurements on identical protein-ligand systems. The standard deviation of IC50 data is only 25% larger than the standard deviation of Ki data, suggesting that mixing IC50 data from different assays, even not knowing assay conditions details, only adds a moderate amount of noise to the overall data. The standard deviation of public ChEMBL IC50 data, as expected, resulted greater than the standard deviation of in-house intra-laboratory/inter-day IC50 data. Augmenting mixed public IC50 data by public Ki data does not deteriorate the quality of the mixed IC50 data, if the Ki is corrected by an offset. For a broad dataset such as ChEMBL database a Ki- IC50 conversion factor of 2 was found to be the most reasonable.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Financial Statement Comparability and the Expected Crash Risk of Stock Prices

The purpose of this study is to explain the relationship between the comparability of financial statements as a qualitative financial reporting feature with the expected risk of stock price crash. The statistical population of this research includes all companies admitted to Tehran Stock Exchange. In order to achieve the research goal, 81 companies were selected for the period between 2010 and ...

متن کامل

Comparability of Financial Reports and Negative Skewness of firm-Specific Monthly Returns: Evidence from Iranian firms

The present study aims to investigate the relationship between comparability of financial reports and negative coefficient of skewness of firm-specific monthly returns. In this study, to measure the financial statements comparability, De Franco et al. (2012) model is employed. Sample includes the 425 firm-year observations from companies listed on the Tehran Stock Exchange during the years 2013...

متن کامل

Comparative EU Statistics on Income and Living Conditions: Issues and Challenges

This paper develops and discusses a framework for the assessment of statistical quality in EU-SILC, with focus on comparability as a central dimension of quality. We view data quality as a multidimensional concept, covering not only statistical accuracy but also the relevance, timeliness, comprehensiveness, etc., of the data. There is a broad agreement on what dimensions make up the overall qua...

متن کامل

Comparability of Computer-based and Paper-based Versions of Writing Section of PET in Iranian EFL Context

Computer technology has provided language testing experts with opportunity to develop computerized versions of traditional paper-based language tests. New generations of TOEFL and Cambridge IELTS, BULATS, KET, PET are good examples of computer-based language tests. Since this new method of testing introduces new factors into the realm of language assessment ( e.g. modes of test delivery, famili...

متن کامل

Exploiting a comparability mapping to improve bi-lingual data categorization: a three-mode data analysis perspective

We address in this paper the co-clustering and co-classification of bilingual data laying in two linguistic similarity spaces when a comparability measure defining a mapping between these two spaces is available. A new approach that we can characterized as a three-mode data analysis scheme, is proposed to mix the comparability measure with the two similarity measures. Our aim is to improve join...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013